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A paper by Shier (J. Res NHS SOB) shows how to partition the graph of a matrix into a tree so as to minimize the 
number of operations required to invert the matrix. The present paper shows how to economically solve a sparse 
System of linear equations after the application oi Shier*S method to the coefficient matrix. 
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1 . Introduction 

In I 1 1' Shier points out thai if: (a) the graph corresponding to a sparse matrix 1 is partitioned into subgraphs 

which themselves can he regarded as nodes of a tree, and (b) the nodes of this tree are suitably numbered; 
then A can be partitioned as (Ay) where A y are submatrices and the /lh row of 1 is 

Ai = (A iu • ^A iu 0. 0. • • •, 0. ,1, ,,<,,. 0, • • •,()), i = 1, • • -,n (1) 

and where node r(i) is the "father" of i in the tree. Also. 1 , A . = ( k < i) unless r(k) = i and A is block 
incidence-symmetric He then describes a relatively efficient way of finding A l , involving the computation 
of/1;,- -1 and similar sub-matrices by standard methods for dense matrices combined with reclusive application 
of his algorithm. He also describes a method for earn ing out the tree partitioning in (a) above. 

Unfortunately he does not describe in detail how his method can be applied to the much more common 
problem of solving sparse equations, although he does mention (p. 252, lines 3-5) that it can be so applied. 
This will now be done. 

2. Solution of Equations 

We have to solve: 

Ax = b (2) 

where A is partitioned as in (1) and x and b are partitioned conformably into sub-vectors 

fe) and (/>,-) (3) 

We may very efficiently solve (2) by Block Gaussian Elimination as follows: 
(A) (Elimination of sub-diagonal sub-matrices) 
For i = 1 , ••-,//— 1 do: 

(I) Form multipliers m rii) j = A r{ DjAif 1 (4) 



* An invited paper. 

** Present address: Department of Computer Science and Mathematics, York University, Downsview, Ontario M3J 2R7. 

1 Figures in brackets indicate the literature references at the end of this paper. 

485 



Then (6) can be replaced by: 



Eliminate A r( ,-),/ by subtracting m^^x (row i) from row r(i), i.e. 

(II) Update i4 r( ,-) jr( t -> ^ ^r( O, r( O — ^rO'U^n ^M,r(i) (5) 

(III) Update 6r<««-*r<o " ^(O.Ai" 1 ** ( 6 ) 

(B) (Back-Substitution) 

(IV) x n =A nn ~\ n (7) 

(V)Fori =n - 1, », 1 do: 

%i ~ An iki — A ijr(i) x r( i)j (8) 

(The above simply constitutes Gaussian Elimination with coefficients consisting of submatrices instead of 
scalars.) The great advantage of this method is that there is no fill-in except within the blocks, i.e., a zero 
submatrix always remains zero. 

3. A More Economical Method 

Further economy can be obtained by omitting the explicit calculation of m in (4). Rather we can perform 
triangular decomposition 

A ii = L ii U ii (9) 

(I), Solve Lum = b t (10) 

(II) Solve Uun = v t (11) 
Then Zi =A ii ~ 1 b i (12) 

(III) Form w r(n = A r(i))i Zi (13) 

(IV) 6 r(f) *-b rii) - w r{i) (14) 

(5) Can be replaced by similar calculations with each column o(A i>rii) taking the place, in turn, of b{. 

(7) Can be replaced by (9), (10), (11) with i = n. 

(8) Can be replaced by: 

(I) ki *~ ki ~~ Ai, r u)Xr(i) (15) 

(II) Solve L u vi = b t (16) 

(III) Solve UuXi = (v)i (17) 

4. Operation Count 

(1) If the explicit inverse A # _1 and m r(ihi are employed as in §2, using dense matrix techniques, the 
operation count would be as follows, assuming A H is order Pi*Pi, and/?, = p for all i: the formation of 
A if 1 , m r( i hi and equation (5) each require 0(p 3 ) multiplications, for a total of 0(3p 3 ); while eq (6) 
requires 0(p 2 ), (7) requires 0(p 3 + p 2 ) and (8) requires 0(2p 2 ). Thus, the total number of multiplications 
is approximately 
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(n - L)3/> 3 + (// - L)3p 2 + /> :{ + p 2 = (3n - 2)(/, :{ + /; 2 ). (18) 

(II) If the method of §3 is used we have: equation ( ( >) requires 0(p /3) multiplications; equations (10), (11) 
and (13) together need 0(2p ). The solution of equations ( 10), (II) and ( L3) with any column of A.i, r {i) 
in place of bj requires 0(2/> J ) for each column, i.e. 0(2/; ) in all. Equations (15)— (17) require 0(2p 2 ). 

. . . (p 3 A 

Equations (9), (10) and (II) fori — // require 01 ■ — h p J. I hns. the total number oi multiplications 
required for this method is approximately 

7 -n-2\p 3 + (4n-3)p i (19) 

Thus the method of §3 is more efficient for large pi = p, when we may ignore multiplicative and 
overhead factors. 

(Ill) If the equations are solved directly without any partitioning, as if they were full, the number of 
multiplications required is ~ ~(np) + (///>)"% which for large n is much greater than r n P :i - 

5. Labelling of Tree Nodes 

The nodes of the tree must be numbered in such a wax that its incidence matrix has the form (1). This can 
be accomplished for example by a modification of the "Reverse Cuthill-McKee Algorithm" [2]. (This was 
originally devised as a hand-width minimization technique, although that aspect has no relevance in the 
present context.) Simplified and re-worded for our purposes the algorithm may be described thus: 

A. Suppose there are /V nodes in the tree. Choose an arbitrary node and number it A (this is defined as the 
only member of "level" 1). Set/ = I and./ = IS — I. 

B. Consider all nodes adjacent to nodes in level / but as yet unnumbered (they will be defined as members of 
level / + 1). Suppose there are K such nodes in all. If A. = terminate. Otherwise assign to them the 
numbers /, J — L, ••, / — K + 1. 

Set J =/- A and / = /+ I. 

Repeat step H until k — 0. 
It is simple to prove that the incidence matrix of a tree thus numbered has the form (1), i.e. each node 
(numbered i, say) is adjacent to only one node having a higher number (say /(/)). 

PROOF: Suppose if possible a node numbered i is adjacent to two nodes numbered i ', and r 2 such that r x > i, 
r 2 > i. Then the nodes rj and r 2 belong to lower levels than node i. Hence they are both connected, via paths 
not including node i, to node N . Thus we have two separate paths connecting nodes i and /V, i.e. we have a 
loop. But this contradicts the assumption that the graph is a tree. Hence there must be only one node adjacent 
to i with number > i. Q.E.D. 
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